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ABSTRACT 



Classroom interaction data matrices obtained from a 



preservice training project in techniques of achieving divergent 
pupil responses were compared using a statistical procedure outlined 
by Darwin (1959). The procedure interprets interaction sequences as 
realizations of Markov chains. Contrary to results of an earlier 
study (Pena, 1972), interaction matrices were found to satisfy the 
dependency assumptions for Markov chains. Significant differences 
were found between interaction sequences for classes at different 
levels and between trained and untrained student teachers. 
Within-teacher comparisons were not significant. Pena f s conclusion 
that the tests are too powerful is criticized on logical and 
methodological grounds. (Authors) 
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MARKOV CHAIN ANALYSIS OP CLASSRCCM INTERACTION DATA* 



I ? Introduction 

In 196l, Ned Flanders described a technique for class- 
ifying and quantifying sequences of verbal behavior in the 
classroom. Verbal behavior is classified by a trained ob- 
server and then coded into a square interaction matrix. 
Each entry in an interaction matrix represents the frequency 
with which the row category is followed by the column cat- 
egory* Little work has been done in the quantitative anal- 
ysis of classroom interaction data, aside from the deriva- 
tion of several ad hoc indices such as the direct-indirect 
influence ratio* In 1959* Darwin derived a series of like- 
lihood ratio criteria for comparing two or more realiza- 
tions of Markov, or one-dependent probability chains . 
Darwin illustrated their use with some of Flanders 1 early . 
classroom interaction data. In order for Darwin's tests 
to be applicable, however, the chains must possess the 
properties of a Markov chain. 

Based on the likelihood ratio criteria developed by 
Koel (195*0 to test the length of dependence of a proba- 
bility chain, Pena (1972) reported that the chains result- 
ing from classroom interaction observations do not satisfy 

*The authors wish- to acknowledge the cooperation of Dr. 
Charles E. Gray and Dr. Richard C. Youngs for contributing 
the data analyzed in this paper. Data were originally col- 
lected for Experimental Project 178, "Preservice Teacher 
Training and Creativity in Problem Solving: A Developmental 
Investigation/ 1 August, 1971* 



the requirements of a Markov chain; that is, Pena concluded 
that the probability of occurence of any verbal classroom 
behavior appears to depend, not on the immediately preced- 
ing behavior, but rather on the two preceding behaviors. 
Pena also concluded that Darwin 1 s tests were too powerful 
because the tests often identified interaction matrices as 
different when the matrices resulted from observations of 
"educationally homogeneous" classrooms. Her criteria for 
identifying "educationally homogeneous" classrooms, however, 
were comparable mean scores on a math achievement test which 
are affected by many things other than verbal interactions 
within a classroom. 

The results reported by Pena concerning the Markovian 
properties of interaction data are misleading. In order to 
obtain long chains for a single teacher, she combined data 
across five different subject areas; in order to obtain 
long chains for a single subject area, she combined data 
across five different teachers. Such additive procedures 
would be warranted only if differences in interaction 
matrices were shown to be independent of differences in sub- 
ject matter and differences in teachers. Furthermore, by 
combining chains across teachers and across subject areas, 
Pena analyzed chains which ranged in length from 2,398 to 11,756 
tallies (2-10 hours). Although the question of what con- 
stitutes an "observation" is somewhat ambiguous, it seems 
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unlikely that continuous interaction sequences of this 
length would occur naturally in ordinary classroom settings. 

II. Method 

In the analysis reported here chains ranging in 
length from 167-5^ tallies (8-27 minutes) were first 
tested for length of dependence, u, (Hoel, 195^ ), then 
tested for the equality of transition and occupation pro- 
babilities using the four likelihood ratio criteria de- 
rived by Darwin (1959) • The data were obtained from a 
training project which had as its intent the establish- 
ment of teaching skills resulting in increased divergent 
pupil production In problem- solving* 

Five sets of data were selected involving four stu- 
dent-teachers participating in the training project 0 
Two of the teachers were observed in kindergarten classes^ 
one a member of a training group, the other a control . 
Two of the teachers were observed in twelfth grade classes, 
again representing a training and a control group. A 
second observation for the control kindergarten teacher 
was included to provide within-subject comparisons. Ob- 
servations were recorded on videotape and coded by a 
single observer who verified doubtful classifications by 
repeated viewings 0 The chains of behaviors resulting 
from these observations were each coded into a 9 x 9 
matrix. Original observations were based on a more complex 



coding system (Araidoa, Amidon and Rosenshine, 1969), but 
many categories had no entries in any row or column and 
were subsequently eliminated; other subcategories had few 
entries and were combined with closely related subcate- 
gories in order to simplify the analysis. 

insert table 1 

Hoel f s test for the length of dependence (u) of a 
probability chains employs a likelihood ratio criterion 
distributed as chi-square* The Hoel procedure involves 
choosing a probable length of dependence, testing for 
significance and then decreasing u by one and testing for 
significance again. The procedure is stopped at the 
point where u is not significant but u-1 is significant. 
Each of the five interaction chains used in this analysis 
was tested for two-dependence and one-dependence. In all 
five cases the X 2 criteria (converted to Z) for two- 
dependence were not significant, while the X 2 criteria 
for one-dependence were significant. 

insert table 2 

These results indicated that each of the five chains used 
in this analysis is a realization of a Markov chain, and 
consequently, that the Darwin tests would be applicable. 
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The Darwin procedure provides four likelihood ratio 
criterion tests for comparing the equality of any number 
of realizations of a Markov chain© However, the present 
analysis considered only the case in which two realiza- 
tions are tested for equality* Briefly, Darwin's four 
likelihood ratio criteria test the equality of : 

1, two complete sets of transition probabilities, 
Pjk* 

2, two off -diagonal sets of transition probabil- 
ities, p.^, regardless of the diagonal values, 

3» two diagonal sets of transition probabilities, 
PjJ* regardless of the off-diagonal values, 
Pjk* 

.1 

k* two sets of occupation probabilities, Pj, or the 
probability of occurence of a behavior in any 
category* 

The criteria values of tests 2 # and 3* BTe addi- 
tively equal to the criterion value of test !• Test U, 
for the equality of two sets of occupation probabilities 
may seem redundant since the Pj values are related to the 
Pjk values* However, Darwin points out that it is pos- 
sible for two realizations of a Markov chain to differ so 
slightly that the difference will not be detected by test 1, 
but that the particular functions of these differences as 
reflected in the Pj mey result in the significance of 
the criterion value of test k* 

The four Darwin tests were applied to all six possible 
between-teacher comparisons and the one within-teacher 
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'comparison* 

III, Results and Discussion 

All four likelihood ratios were significant (p>.001) 
for the six between-teacher comparisons* 

insert table 3 

In these six comparisons, between 61-87$ of the first cri- 
terion value was accounted for by test 2, indicating that 
these matrices had more pronounced differences in their off- 
diagonal entries than in their steady-states* In the within- 
teacher comparisons, the first three tests revealed no 
significant overall differences (pC*10), but the fourth like- 
lihood ratio, testing the equality of occupation probabilities 
was significant (p> •001)* These results suggest that inter- 
action matrices may reflect within-subject consistency and 
that general activity patterns are essentially the same for a 
single teacher. Occupation probabilities, however, reflect 
differences in the time spent in each category of behavior, 
and such time differences may vary from situation to situa- 
tion for a particular teacher. 

Another noteworthy feature of the data is that the cri- 
teria values associated with between-grade, within-training 
comparisons were greater in magnitude than the criteria values 
associated with between-training, within-grade comparisons* 
Thus the data confirm the common sense assumption that grade 



level differences are more important than training group 
differences to the patterns of verbal behavior represented 
in an interaction matrix. In the original study from 
which the data were obtained, the grade level effect was 
also more pronounced than the training group effect in 
regards to the number of hypotheses generated per minute 
(hpm) during an observation. Rank order correlations 
(Kendall's Tau) were computed between each of the four 
sets of likelihood ratio criteria and the absolute dif- 
ference in the number of hypotheses per minute (hpm) for 
each of the six between-teacher pairs. 

insert table h 

Since the magnitude of a likelihood ratio criterion re- 
presents the degree of discrepancy between two realizations 
of a Markov chain, these correlations suggest that the 
greater the difference in the off -diagonal entries of two 
interaction matrices, the greater the difference in the 
nuiiber of hypotheses generated per minute in the two class- 
rooms. A relatively strong relationship also appears to 
exist between differences in the total proportion of time 
spent in any behavioral category and differences in the 
hpms for two classrooms. 

Pena also pointed out that one factor influencing the 
magnitude of a likelihood ratio criterion is the length of 
the chains used in an analysis * Rank order correlations 



between the total length of each pair of realizations and 
the four sets of likelihood ratio criteria were also com- 
puted and results tended to confirm Pena f s statement, with 
the exception of likelihood ratio criterion 2, the test for 
the equality cf off -diagonal probabilities. 

insert table 5 

Fewer entries in the off -diagonal cells appear to be some- 
what related to large criterion values. 

There is reason to believe that in sequences of ordin- 
ary lengths, chains derived from interaction analysis ob- 
servations are one-dependent or Markovian in nature , and 
consequently the procedure outlined by Darwin may be a use- 
ful method of quantitative analysis for the dynamics of 
classroom behavior. Differences in the magnitude of like- 
lihood ratio criteria compared across grades and across 
treatment groups indicate that grade l^vel has a greater 
influence on general patterns of verbal interactions than 
dos6 training* Since within-subject coinparisons were not 
significant in three of the four tests, aMarkov analysis 
of classroom interaction data seem6 a potentially suitable 
method for describing the stable characteristics of a 
single teacher, and adding chains of observations across 
different teachers should not be done without previous 
testing to assure that all teachers' characteristics are 
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similar. 



Although the procedures used in this anlysis do .offer 



a foundation for further theoretical work in the stu'y of 
teaching, several issues of practical and theoretic^el in- 
terest remain unresolved. In this study, a relationship 
was found between differences in interaction sequences and 
the number of hpms, but analysis by a Markovian model was 
a PQS'fc hoc procedure: data collection procedures were not* 1 
designed to examine, or even reveal, relationships of this 
kind. Further theoretical attention should be given to 
appropriate choice of criteria. The question raised by 
Pena's conclusion that the Darwin tests are too rowerful 
also remains unresolved and is confounded by the apparent 
relationship between the length of the chains used in the 
analysis and the resulting criterion value. An answer to 
this question would seem to depend upon a combination of a 
theory of teaching relatable to a Markov model and Monte 
Carlo studies of error rates for selected transition and 
occupation probability parameter values. 
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APPENDIX OF TABLES 



ERIC 



TABLE 1 



EXPANDED INTERACTION ANALYSIS CATEGORY SYSTEM 



TEACHER TALK 



1« ACCEPTS STUDENT FEELINGS 

la - Acknowledges feelings 

1c - Clarified feelings,, 

lr - Refers to similar feelings of others 



Combined 



f*2 # PRAISES 

2w - Without criteria 
2p - With public criteria 
2p - With private criteria 

3. ACCEPTS STUDENT IDEAS 

3a - Acknowledges ideas 
3c - Clarifies ideas 
3s - Summarizes ideas 

^ ASKS QUESTIONS 

- Factual questions 
he - Convergent questions 
*i*d - Divergent questions 
*ke - Evaluative questions 



*5. 



Combined 



6. 



LECTURES 

5f - Factual lecture 
5m - Motivational lecture 
5o m Orientational lecture 
ip - Personal opinion lecture 



GIVES DIRECTIONS 
6c - Cognitive directions 
6m - Managerial directions 



7. CRITICIZES 

7w - Without criteria 
TP - With public criteria 
7p - With private criteria 



STUDENT TALK 



Combined 



8. STUDENT TALK, PREDICTABLE 
*8f - Factual student talk 
8c - Convergent student talk 

*9. STUDENT TALK, UNPREDICTABLE 

9d - Divergent student talk 
"9e - Evaluative student talk 
m student-initiated talk 



*Used in present 
analysis 



Table 1, Continued 



NO TALK *10. SILENCE CP CONFUSION 

(108 - Silence 
Combined J 10c - Confusion 

\jlo - A change of speakers in student-to-student 
interaction, end the beginning and end of a 
coding sequence in matrix construction. 



From Edmund Anddon, Peggy Axnidon, and Barak Roaenshine, SKIT Work Manual, 
Minneapolis: Association for Productive Teaching, 1969, p« 13* 
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TABLE 2 

RESULTS OF THE HOEL TEST FOR THE LENGTH OF DEPENDENCE 
OF A PROBABILITY CHAIN 







EXPERIMENTAL 


LENGTH OF CHAIN 










TEACHER 


GRADE 


GROUP 


(No. of tallies) 


u 


X 2 


df 


Z* 


1 


Kindergarten 


Control 


325 


2 


241.91 


576 


-11.93 








325 


1 


3^.19 


72 


lk.28 


1 


Kindergarten 


Control 


167 


2 


1U6.73 


576 


- 7.63 








167 


1 


228.32 


72 


9.62 


2 


Kindergarten 


Training 


307 


2 


191.89 


576 


-14.3U 








307 


1 


293.96 


72 


12.29 


3 


12th 


Control 


323 


2 


23U.83 


576 


-12.26 








323 


1 


37^.59 


72 


15.^1 


k 


12th 


Training 


5W 


2 


93.57 


576 


-20.25 






5Uh | 1 


816.08 


72 


28.1* 



* 2 ~ v) 2X 2 - si 2df -1 . X 2 converted to Z due to df > 70 . 
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TABLE k 

RANK ORDER CORRELATIONS BETWEEN ABSOLUTE DIFFERENCES 
IN HPMS OF ALL BETWEEN-TEACHER COMPARISONS AND 
THEIR RESPECTIVE LIKELIHOOD RATIO CRITERIA 

hpnib-hpnia 



A. LRC 




1 


2 


3 


1* 


1 1 


- 


1.17 


.81 


1.1+1 




2 i 


232 




.36 


.24 


Criteria 










60 


Values 


3 i 

! 


133 






\ 

k i 




138 


194 




.B, LRC2 




1 


2 


3 


1+ 




•i 

1 


- 


1.17 


.81 


l.4l 




2 


150 




.36 


.24 


Criteria 










• DO 


Values 


3 


103 






1* 


i 160 




133 




C. LRC 3 




1 


2 


3 




1 


» 


1.17 


.81 


i.4i 




2 


! 82 




.36 


.21* 


Criteria 










.60 


Values 


3 


j 29 






h 


i 2h 


44 


61 




D* LRCI4 




1 


2 


3 




1 


i 


1.17 


.81 


1.4i 




2 


| 89 




.36 


.21* 


Criteria 






94 




.60 


Values 


3 


| ,8 






k 


i h9 


51 


37 





!T Y= .07 



rr T= .60 



'•T' - - »33 



.33 



TABLE 5 

RANK ORDER CORRELATIONS BETWEEN TOTAL 
LENGTHS OF BETWEEN-TEACHER PAIRS 
OP CHAINS AND THEIR RESPECTIVE 
LIKELIHOOD RATIO CRITERIA 



Total Length of Pairs 



A. LRCi 




1 


2 


3 


h 






1 


> 

i 


81*0 


627 


628 




Criteria 


2 

3 


1 

! 232 

1 133 

t 


2^3 


868 


876 

656 






u 




138 


194 


mm 




Bo LRCg . 




1 


2 


3 


k 






1 




8U0 


627 


628 




Criteria 

V tUUcB 


2 

5 


150 

j J.UJ 


IhO 


868 

mm 


876 


r- 




k 


1 160 




133 






C • LRC j 




1 


2 


3 


i* 






1 






627 


628 




Criteria 
Values 


2 
3 


82 

i 

;■ 29 


<* 


868 


876 
656 






k 


j 2k 




61 






D* LRC^ 




1 


2 


3 


h 






1 


i 


8U0 


627 


628 




Criteria 
Values 


2 
3 


!89 
j 1*8 




868 


876 
656 






k 


1 ^9 


51 


37 







